Traditional and context-specific spam detection in low resource settings
نویسندگان
چکیده
Social media data has a mix of high and low-quality content. One form commonly studied content is spam. Most studies assume that spam context-neutral. We show on different Twitter sets context-specific exists identifiable. then compare multiple traditional machine learning models neural network model uses pre-trained BERT language to capture contextual features for identifying spam, both context-specific, using only content-based features. The outperforms the with an F1 score 0.91. Because training are notoriously imbalanced, we also investigate impact this imbalance simple Bag-of-Words best extreme imbalance, but fine-tunes from other domains significantly improves score, not levels domain-specific models. This suggests strategy employed may vary depending upon level in set, amount available low resource setting, prevalence vs. Finally, make our use by research community.
منابع مشابه
The Continued Utility and Viability of Dakin’s Solution in Both High- and Low-resource Settings
Healthcare is expensive and often inaccessible to many. As a result, surgeons must consider simple, less expensiveinterventions when possible. For wound care, an older but quite effective cleaning agent is Dakin’s solution (0.5%sodium hypochlorite), an easily made mixture of 100 milliliters (ml) bleach with 8 teaspoons (tsp) baking soda into agallon of clean water or 25 ml ble...
متن کاملTelemedicine in Low-Resource Settings
Telemedicine is a fuzzy term with several synonyms (telehealth, e-health, etc), which cover a wide range of topics, all concerning the delivery of health care at a distance. “Health care” itself is a broad concept, encompassing diagnosis and treatment of patients, education of staff, patients, and the general public, and administrative activities, such as collecting public health data, as well ...
متن کاملSpam Detection on Twitter Using Traditional Classifiers
Social networking sites have become very popular in recent years. Users use them to find new friends, updates their existing friends with their latest thoughts and activities. Among these sites, Twitter is the fastest growing site. Its popularity also attracts many spammers to infiltrate legitimate users’ accounts with a large amount of spam messages. In this paper, we discuss some userbased an...
متن کاملHow e-learning creates new opportunities in hospital settings? Innovations in a low resource setting
Background: E-learning and telemedicine have become common methods in changing and developing medical education and clinical processes. The purpose of this study was to describe the innovations of blending e-Learning into the educational and medical processes of hospital services Methods: The process of action research included plan, act, observation and reflection was followed. Implementation,...
متن کاملCervical cancer prevention in low-resource settings.
Objective: To help care providers understand the current status of cervical cancer in low-resource countries. Options: The most effective and practical options for cervical screening and treatment in low-resource countries are evaluated. Outcomes: Improvement in rates of prevention and early detection of cervical cancer in low-resource countries. Evidence: PubMed or Medline, CINAHL, and The Coc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Machine Learning
سال: 2022
ISSN: ['0885-6125', '1573-0565']
DOI: https://doi.org/10.1007/s10994-022-06176-x